Expressive Text-to-Image Generation with Rich Text